Embedded flash memory device with floating gate embedded in a substrate

ABSTRACT

An embedded flash memory device includes a gate stack, which includes a bottom dielectric layer extending into a recess in a semiconductor substrate, and a charge storage layer over the bottom dielectric layer. The charge storage layer includes a portion in the recess. The gate stack further includes a top dielectric layer over the charge storage layer, and a metal gate over the top dielectric layer. Source and drain regions are in the semiconductor substrate, and are on opposite sides of the gate stack.

BACKGROUND

Flash memories, which use dielectric trapping layers or floating layers to store charges, are often used in System-On-Chip (SOC) technology, and are formed on the same chip along with other integrated circuits. For example, High-Voltage (HV) circuits, Input/output (IO) circuits, core circuits, and Static Random Access Memory (SRAM) circuits are often integrated on the same chip as the flash memories. The respective flash memories are often referred to as embedded memories since they are embedded in the chip on which other circuits are formed, as compared to the flash memories formed on chips that do not have other circuits. Flash memories have structures different from HV circuit devices, IO circuit devices, core circuit devices, and SRAM circuit devices. Therefore, the embedding of memory devices with other types of devices faces challenges when the technology evolves.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the embodiments, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:

FIGS. 1 through 18 are cross-sectional views of intermediate stages in the manufacturing of embedded memory devices and other types of devices in accordance with some exemplary embodiments;

FIGS. 19 and 20 are cross-sectional views of intermediate stages in the manufacturing of embedded memory devices in accordance with some exemplary embodiments, wherein the charge storage layers of a plurality of embedded memory devices are formed in discrete recesses; and

FIGS. 21 and 22 are cross-sectional views of intermediate stages in the manufacturing of embedded memory devices in accordance with some exemplary embodiments, wherein the charge storage layers of a plurality of embedded memory devices are formed in a same continuous recess.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

The making and using of the embodiments of the disclosure are discussed in detail below. It should be appreciated, however, that the embodiments provide many applicable concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed are illustrative, and do not limit the scope of the disclosure.

An embedded memory device and the methods of forming the same are provided in accordance with various exemplary embodiments. The intermediate stages of forming the embedded memory device are illustrated. The variations of the embodiments are discussed. Throughout the various views and illustrative embodiments, like reference numbers are used to designate like elements.

Referring to FIG. 1, semiconductor substrate 20, which is a part of semiconductor wafer 2, is provided. In some embodiments, semiconductor substrate 20 includes crystalline silicon. Other commonly used materials such as carbon, germanium, gallium, boron, arsenic, nitrogen, indium, phosphorus, and/or the like, may also be included in semiconductor substrate 20. Semiconductor substrate 20 may be a bulk substrate or a Semiconductor-On-Insulator (SOI) substrate. In some exemplary embodiments, semiconductor substrate 20 comprises Si_(1-z)Ge_(z), wherein value z is the atomic percentage of germanium in SiGe, and may be any value ranging from, and including, 0 and 1. For example, when value z is 0, semiconductor substrate 20 comprises a crystalline silicon substrate. When value z is 1, semiconductor substrate 20 comprises a crystalline germanium substrate. Substrate 20 may also have a compound structure including a III-V compound semiconductor on a silicon substrate, or a silicon germanium (or germanium) layer on a silicon substrate.

Semiconductor substrate 20 includes portions in regions 100, 200, 300, and 400. In accordance with some embodiments, regions 100, 200, 300, and 400 include an embedded flash memory region, a High-Voltage (HV) region, an Input/output (IO) region, and a Static Random Access Memory (SRAM) region/general logic device region, respectively. Embedded flash memory region 100 is used for forming embedded flash memory cells (such as 156 in FIGS. 18, 20, and 22) therein. HV region 200 is used for forming HV devices (such as 256 in FIG. 18) therein. IO Region 300 is used for forming IO devices (such as 356 in FIG. 18) therein. Core/SRAM Region 400 is used for forming core devices and/or SRAM cells (such as 456 in FIG. 18) therein. The core devices, sometimes referred to as logic devices, do not include any memory array therein, and may be, or may not be, in the peripheral region of SRAM arrays. For example, the core devices may be in the driver circuit or the decoder circuit of the SRAM arrays (in region 400) or the flash memory array in region 100. The HV devices are supplied with, and are configured to endure, a positive power supply voltage Vdd1 higher than the positive power supply voltage Vdd2 of the devices in region SRAM/core region 400. For example, power supply voltage Vdd2 may be lower than about 1V, and power supply voltage Vdd1 may be between about 1.5V and about 3.3V. Although portions of substrate 20 in regions 100, 200, 300, and 400 are shown as disconnected, they are portions of the same continuous substrate 20.

Referring to FIG. 2, recess 4 is formed in substrate 20, for example, by etching substrate 20. Depth D1 of recess 4 is close to the thickness of the charge storage layer 10 (FIG. 5) that is to be formed in recess 4 in a subsequent step. In some exemplary embodiments, depth D1 is between about 100 nm and about 200 nm, although different depths may be adopted.

As shown in FIG. 3, bottom dielectric layer 6 is formed on substrate 20. In some embodiments, bottom dielectric layer 6 is formed of silicon oxide, which may be formed by performing a thermal oxidation on substrate 20. In alternative embodiments, bottom dielectric layer 6 comprises silicon oxynitride or other dielectric materials that have low leakage of charges. In some embodiments, thickness T1 of bottom dielectric layer 6 is between about 20 Å and about 50 Å. It is appreciated, however, that the values recited throughout the description are merely examples, and may be changed to different values. In alternative embodiments, bottom dielectric layer 6 is formed through deposition. Bottom dielectric layer 6 may be a conformal layer with the vertical portions and horizontal portions having similar thicknesses, for example, with differences smaller than 20 percent of either one of the thicknesses of the vertical portions and horizontal portions.

Referring to FIG. 4, blanket charge storage layer 8 is formed. In some embodiments, charge storage layer 8 is formed of a conductive material such as polysilicon, metal, or the like. In alternative embodiments, charge storage layer 8 is formed of a dielectric material with a high trap density. In some exemplary embodiment, charge trapping layer 24 comprises silicon nitride (SiN). Charge storage layer 8 fills the unfilled portion of recess 4.

Next, referring to FIG. 5, a planarization such as a Chemical Mechanical Polish (CMP) is performed to remove excess portions of charge storage layer 8. The remaining portion of charge storage layer 8 is referred to as charge storage layer 10 (sometimes referred to as a floating gate) hereinafter. During the CMP, the portions 6A of bottom dielectric layer 6, which portions are over substrate 20, are used as a CMP stop layer. Accordingly, the top surface of charge storage layer 10 is coplanar with the top surface of portions 6A of bottom dielectric layer 6. After the CMP, the top surface 10A of charge storage layer 10 is slightly higher than top surfaces 20B of substrate portions 200/300/400, with height difference ΔH being between about 5 nm and about 50 nm, for example. In alternative embodiments, the top surface 10A of charge storage layer 10 is slightly lower than top surfaces 20B of substrate portions 200/300/400. The majority of charge storage layer 10 may be embedded in substrate 20, with a small portion over substrate 20. For example, height difference ΔH may be smaller than about 40 percent of thickness H1 of charge storage layer 10.

FIG. 6 illustrates the formation of top dielectric layer 12, which may be a single layer or a composite layer. In some embodiments, top dielectric layer 12 is a single layer, which may be a silicon oxide layer, a silicon oxynitride layer, or the like. In alternative embodiments, top dielectric layer 12 is a composite layer comprising a plurality of dielectric layers. For example, FIG. 6 illustrates that dielectric layer 12 has a triple-layer structure, which may include an Oxide-Nitride-Oxide (ONO) structure, with layers 22, 24, and 28 being a silicon oxide layer, a silicon nitride layer, and a silicon oxide layer, respectively.

Referring to FIG. 7, bottom dielectric layer 6 and top dielectric layer 12 are patterned in an etching step. The portions of bottom dielectric layer 6 and top dielectric layer 12 are removed from regions 200, 300, and 400. The portion of bottom dielectric layer 6 and top dielectric layer 12 in region 100 are left un-removed. After the patterning, as shown in FIG. 8, HV dielectric layer 26 is formed in regions 200, 300, and 400. Thickness T2 of HV dielectric layer 26 may be between about 50 Å and about 300 Å.

In accordance with some embodiments, HV dielectric layer 26 is formed using thermal oxidation by oxidizing substrate 20. Accordingly, HV dielectric layer 26 is formed in regions 200, 300, and 400, and not in region 100. In alternative embodiments, HV dielectric layer 26 is formed using a Chemical Vapor Deposition (CVD) method such as Plasma Enhance CVD (PECVD), Low Pressure CVD (LPCVD), Atomic Layer Deposition (ALD), or the like. In these embodiments, HV dielectric layer 26 may comprise silicon oxide, silicon oxynitride, or the like. The dielectric constant of the HV dielectric layer 26 and dielectric layer 28 may be about 3.8 in some embodiments.

As shown in FIG. 9, HV dielectric layer 26 is patterned, and is removed from regions 300 and 400. Next, Referring to FIG. 10, IO dielectric layer 30 is formed. In some embodiments, IO dielectric layer 30 comprises silicon oxide. Alternatively, IO dielectric layer 30 comprises silicon oxynitride. Thickness T3 of IO dielectric layer 30 may be between about 20 Å and about 70 Å, which may be smaller than thickness T2 of HV dielectric layer 26 in some embodiments. Similarly, IO dielectric layer 30 may be formed through thermal oxidation of substrate 20, deposition, or the like. After the formation of IO dielectric layer 30, IO dielectric layer 30 is removed from region 400.

Referring to FIG. 11, interfacial layer 32 is formed on substrate 20. Interfacial layer 32 may comprise a chemical oxide, a thermal oxide, or the like. In some embodiments, interfacial layer 32 is formed by oxidizing the exposed surface portion of substrate 20. In alternative embodiments, interfacial layer 32 is formed by treating the surface portion of substrate 20 using a chemical, for example, an oxidant such as ozone water or hydrogen peroxide. The resulting interfacial layer 32 is referred to as a chemical oxide layer, which comprises silicon oxide. Thickness T4 of interfacial layer 32 may be between about 8 Å and about 20 Å, which may be smaller than thickness T3 of IO dielectric layer 30 in some embodiments.

Referring to FIG. 12, high-k dielectric layer 34, capping layer 36, and dummy gate layer 38 are formed sequentially, and are formed in regions 100, 200, 300, and 400 simultaneously. Accordingly, each of layers 34, 36, and 38 has a same thickness and a same material in regions 100, 200, 300, and 400. Dummy gate layer 38 may be formed of polysilicon in some exemplary embodiments. High-k dielectric layer 34 may have a k value greater than about 7.0, and may include an oxide or a silicate of Hf, Al, Zr, La, Mg, Ba, Ti, Pb, Yb, Pr, Nd, Gd, Er, Dy, or combinations thereof. Exemplary materials of high-k dielectric layer 34 include MgO_(x), BaTi_(x)O_(y), BaSr_(x)Ti_(y)O_(z), PbTi_(x)O_(y), PbZr_(x)Ti_(y)O_(z), and the like, with values X, Y, and Z being between 0 and 1. The thickness of high-k dielectric layer 34 may be between about 0.5 nm and about 10 nm. The formation methods of high-k dielectric layer 34 may include Molecular-Beam Deposition (MBD), Atomic Layer Deposition (ALD), Physical Vapor Deposition (PVD), and the like.

Over high-k dielectric layer 34, capping layer 36 may be formed. In some embodiments, capping layer 36 comprises titanium nitride (TiN). In alternative embodiments, the exemplary materials of capping layer 36 include tantalum-containing materials and/or titanium-containing materials such as TaC, TaN, TaAlN, TaSiN, and combinations thereof. Dummy gate layer 38 is then formed over capping layer 36.

FIGS. 13 through 18 illustrate the formation of devices in regions 100, 200, 300, and 400 using a gate-last approach, wherein the gates of the devices are referred to as replacement gates. Referring to FIG. 13, layers 12, 26, 30, 32, 34, 36, and 38 are patterned, forming layer stacks 140, 240, 340, and 440 in regions 100, 200, 300, and 400, respectively. After the patterning, lightly doped source and drain regions (not shown) and/or packet regions (not shown) may be formed adjacent to either one or all layer stacks 140, 240, 340, and 440.

Next, referring to FIG. 14, gate spacers 42 are formed on the sidewalls of layer stacks 140, 240, 340, and 440. In some embodiments, gate spacers 42 comprise silicon nitride, although other dielectric materials may also be used. The formation of gate spacers 42 includes forming a blanket layer(s), and performing an anisotropic etching to remove the horizontal portions of the blanket layer. The remaining portions of the blanket layer form gate spacers 42.

FIG. 15 illustrates the formation of source and drain regions 44, which are alternatively referred to as a source/drain regions 44 hereinafter. Source/drain regions 44 may be formed through implantation or epitaxy. The formation details of source/drain regions 44 are not discussed herein.

FIG. 16 illustrates the formation of Inter-Layer Dielectric (ILD) 46, which is formed of a dielectric material such as Phospho-Silicate Glass (PSG), Boro-Silicate Glass (BSG), Boron-Doped Phospho-Silicate Glass (BPSG), or the like. ILD 46 has a top surface higher than the top surface of layer stacks 140, 240, 340, and 440. A CMP may then be performed to level the top surface of ILD 46 and the top surfaces of the layer stacks, as shown in FIG. 17.

Referring to FIG. 18, the remaining portions of polysilicon layer 38 (FIG. 17) are removed, for example, through etching, and are replaced with replacement gates. The replacement gates include metal gate electrodes 152, 252, 352, and 452. Metal gate electrodes 152, 252, 352, and 452 may have a single layer structure or a multi-layer structure including a plurality of layers, which is schematically illustrated using reference notations 148 and 150. Metal gate electrode 152 forms the gate electrode of embedded flash memory 156. Metal gate electrode 252 forms the gate electrode of HV device (transistor) 256. Metal gate electrode 352 forms the gate electrode of IO device (transistor) 356. Metal gate electrode 452 forms the gate electrode of core or SRAM device (transistor) 456. Gate electrodes 152, 252, 352, and 452 may comprise metal or metal alloys such as Cu, W, Co, Ru, Al, TiN, TaN, TaC, combinations thereof, and multi-layers thereof. As shown in FIG. 18, the top surface of metal gate 152 is coplanar with the top surfaces of metal gates 252, 352, and 452 due to the CMP. The bottom surface of metal gate 152 is higher than the bottom surfaces of metal gates 252, 352, and 452.

In subsequent steps, contact openings (not shown) are formed in ILD 46, exposing underlying source/drain regions 44. Source/drain silicides and sourced/drain contact plugs (not shown) may be formed to electrically couple to source/drain regions 44. The formation of memory device 156, HV transistor 256, IO transistor 356, and core/SRAM transistor 456 is thus finished.

In memory region 100, there may be a plurality of memory devices having the same structure, for example, the structure of memory device 156 in FIG. 18. The plurality of memory devices 156 may be arranged as an array including a plurality of rows and columns of the flash memory devices. FIG. 19 illustrates a cross-sectional view of device region 100, in which a plurality of memory devices 156 is to be formed. In accordance with some embodiments, in the recessing of substrate 20, which recessing step is shown in FIG. 2, discrete recesses 4 are formed. The discrete recesses 4 may form an array in the top view of the structure in FIG. 19. Each of the recesses 4 is used to form the charge storage layer of one of the embedded flash memory devices. The portions of substrate 20 between discrete recesses 4 are not etched, and hence have top surfaces 20A higher than the bottom surfaces of recesses 4.

In subsequent steps in accordance with these embodiments, the process steps shown in FIGS. 3 through 18 are performed to form a plurality of memory devices 156, and the resulting structure is shown in FIG. 20. Devices 256, 356, and 456 are not shown in FIG. 20, and are the same as in FIG. 18. As shown in FIG. 20, charge storage layers 10 and the respective bottom dielectric layers 6 are formed in discrete recesses 4 (FIG. 19) in substrate 20. Substrate 20 thus includes un-etched portions on opposite sides of, and adjacent to, each of charge storage layers 10. In these embodiments, in device region 100, some portions of substrate 20 between neighboring devices 156 may have top surfaces 20A (also shown in FIG. 18) that are coplanar with the top surfaces 20B (FIG. 18) of the portions of substrate 20 in regions 200, 300, and 400.

In accordance with alternative embodiments, instead of forming discrete recesses in order to place charge storage layers, the portions of semiconductor substrate between recesses 4, which are used for forming charge storage layers 10 in, are also etched. Hence, the entirety of the substrate 20 in device region 100, at which a memory array is to be formed, is recessed. FIG. 21 illustrates a cross-sectional view of device region 100 and recess 4, in which a plurality of memory devices 156 is to be formed. In accordance with some embodiments, in the recessing of substrate 20, which step is shown in FIG. 2, a block of substrate in device region 100 is recessed. Dashed line 20B illustrates where the top surface of substrate 20 was before the recessing. The level represented by 20B is also the level of the top surfaces of the portions of substrate 20 in regions 200, 300, and 400 (FIG. 18). The recessed top surface of the portion of substrate 20 in region 100 is marked as 20A, which is lower than 20B.

In subsequent steps in accordance with these embodiments, the process steps shown in FIGS. 3 through 18 are performed to form a plurality of memory devices 156, and the resulting structure is shown in FIG. 22. Devices 256, 356, and 456 are not shown in FIG. 22, and are the same as in FIG. 18. As shown in FIG. 22, charge storage layers 10 and the respective bottom dielectric layers 6 are formed in recess 4 that extends throughout a plurality of memory devices 156. Substrate 20 in these embodiments does not include portions on opposite sides of, and adjacent to, each of charge storage layers 10. Rather, in device region 100, charge storage layers 10 and bottom dielectric layers 6 are over top surface 20A, which is lower than top surface 20B of the portions of substrate 20 in regions 200/300/400 (FIG. 18), wherein top surfaces 20B are also shown in FIG. 18.

In accordance with the embodiments of the present disclosure, in the embedded flash memory 156 (FIGS. 13 and 16), floating gates are formed at least partially in substrate 20. Since floating gates have great thicknesses, if floating gates are formed over the substrate, the gate stacks of the embedded flash memory devices will be much higher than the gate stacks of other transistors such as HV transistors, IO transistors, and core/SRAM transistors. This incurs process difficulty. For example, the CMP in the formation of replacement gates cannot be performed because this may cause the entire dummy gates of the embedded flash memory devices to be removed in the CMP. By embedding the floating gates of the flash memory devices in the substrates, the heights of the gate stacks of the flash memory devices are reduced, and the subsequent CMP may be performed.

In addition, high-k dielectric layer 34 is formed over the top dielectric layer 12 to form the blocking layer of the resulting embedded flash memory 156. With the dual layer structure of the blocking layer, the thickness of the high-k dielectric and the top dielectric layer may be reduced without sacrificing the charge retention ability of the memory devices. On the other hand, with the formation of the metal gates in the memory device 156, the mismatch between the threshold voltages of different embedded flash memory devices is reduced. This is advantageous for the formation of flash memory devices having different threshold voltage levels. With small mismatch, different levels of threshold voltages may be clearly distinguished from each other.

In accordance with some embodiments, an embedded flash memory device includes a gate stack, which includes a bottom dielectric layer extending into a recess in a semiconductor substrate, and a charge storage layer over the bottom dielectric layer. The charge storage layer includes a portion in the recess. The gate stack further includes a top dielectric layer over the charge storage layer, and a metal gate over the top dielectric layer. Source and drain regions are in the semiconductor substrate, and are on opposite sides of the gate stack.

In accordance with other embodiments, a gate stack of an embedded flash memory device includes a bottom silicon oxide layer extending on sidewalls and a bottom of a recess in the semiconductor substrate, and a charge storage layer over the bottom silicon oxide layer. A majority of the charge storage layer is embedded in the recess. The gate stack further includes a top oxide layer over the charge storage layer, a high-k dielectric layer over and contacting the top oxide layer, a metal capping layer over and contacting the high-k dielectric layer, and a metal gate over the high-k dielectric layer.

In accordance with yet other embodiments, a method includes recessing a semiconductor substrate to form a recess in a device region of the semiconductor substrate, forming a bottom dielectric layer, wherein the bottom dielectric layer extends on sidewalls and a bottom surface of the recess, forming a charge storage layer over the bottom dielectric layer, wherein a portion of the charge storage layer is in the recess, forming a top dielectric layer over the charge storage layer, forming a metal gate over the top dielectric layer, and forming source and drain regions in the semiconductor substrate and on opposite sides of the charge storage layer.

Although the embodiments and their advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the embodiments as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, and composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed, that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the disclosure. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps. In addition, each claim constitutes a separate embodiment, and the combination of various claims and embodiments are within the scope of the disclosure. 

What is claimed is:
 1. A device comprising: a semiconductor substrate; and an embedded flash memory device comprising: a first gate stack comprising: a bottom dielectric layer extending into a recess in the semiconductor substrate; a charge storage layer over the bottom dielectric layer, wherein the charge storage layer comprises a first portion in the recess; and a second portion out of the recess; a top dielectric layer over the charge storage layer; and a first metal gate over the top dielectric layer; and first source and drain regions in the semiconductor substrate, wherein the first source and drain regions are isolated from each other by a channel of the embedded flash memory device, and the first source and drain regions extend from a top surface of the semiconductor substrate into the semiconductor substrate.
 2. The device of claim 1 further comprising a first high-k dielectric layer over the top dielectric layer and underlying the first metal gate.
 3. The device of claim 2, wherein the top dielectric layer comprises: a first oxide layer; a nitride layer over the first oxide layer; and a second oxide layer over the nitride layer.
 4. The device of claim 2 further comprising: a non-memory transistor comprising a second gate stack, wherein the second gate stack comprises: a dielectric layer over the semiconductor substrate; a second high-k dielectric layer over the dielectric layer, wherein the first high-k dielectric layer and the second high-k dielectric layer are formed of a same material, and have a same thickness; and a second metal gate over the second high-k dielectric layer, wherein the first metal gate and the second metal gate are formed of a same material, and have a same thickness.
 5. The device of claim 1 further comprising a first metal capping layer overlying the top dielectric layer and underlying the first metal gate, with the first metal capping layer in physical contact with the first metal gate.
 6. The device of claim 1, wherein the embedded flash memory device is comprised in a memory array comprising a plurality of embedded flash memory devices, wherein the semiconductor substrate comprises an intermediate portion between, and coplanar with, charge storage layers of two neighboring ones of the plurality of embedded flash memory devices.
 7. The device of claim 1, wherein the embedded flash memory device is comprised in a memory array comprising a plurality of embedded flash memory devices, wherein between charge storage layers of two neighboring ones of the plurality of embedded flash memory devices, the semiconductor substrate does not include any portion therein.
 8. A device comprising: a semiconductor substrate; an embedded flash memory device comprising a first gate stack, wherein the first gate stack comprises: a bottom silicon oxide layer extending on sidewalls and a bottom of a recess in the semiconductor substrate; a charge storage layer over the bottom silicon oxide layer, wherein a lower portion of the charge storage layer is embedded in the recess; and an upper portion of the charge storage layer is higher than a top surface of the semiconductor substrate; a top oxide layer over the charge storage layer; a first high-k dielectric layer over and contacting the top oxide layer; a first metal capping layer over and contacting the first high-k dielectric layer; and a first metal gate over and contacting the first metal capping layer; and a source region and a drain region in the semiconductor substrate, wherein the source region and the drain region are isolated from each other by a channel of the embedded flash memory device, and the source region and the drain region extend from the top surface of the semiconductor substrate into the semiconductor substrate.
 9. The device of claim 8, wherein the bottom silicon oxide layer comprises a horizontal portion over and contacting the semiconductor substrate, and wherein the horizontal portion of the bottom silicon oxide layer, the first high-k dielectric layer, and the first metal capping layer are co-terminus.
 10. The device of claim 8 further comprising a non-memory transistor, wherein the non-memory transistor comprises a second gate stack comprising: an oxide layer over the semiconductor substrate; a second high-k dielectric layer over and contacting the oxide layer, wherein the first and the second high-k dielectric layers have a same thickness, and are formed of a same material; a second metal capping layer over and contacting the second high-k dielectric layer, wherein the first and the second metal capping layers have a same thickness, and are formed of a same material; and a second metal gate over the second metal capping layer, wherein a top surface of the first metal gate is coplanar with a top surface of the second metal gate, and a bottom surface of the first metal gate is higher than a bottom surface of the second metal gate.
 11. The device of claim 10, wherein the non-memory transistor is an High-Voltage (HV) transistor in a high voltage device region.
 12. The device of claim 10, wherein the non-memory transistor is an Input/Output (I0) transistor in an Input/output (I0) device region.
 13. The device of claim 10, wherein the non-memory transistor is a core transistor in a core device region.
 14. The device of claim 10, wherein the first and the second metal capping layers comprise titanium nitride.
 15. A device comprising: a semiconductor substrate comprising a first top surface and a second top surface higher than the first top surface; a first embedded flash memory device comprising a first gate stack, wherein the first gate stack comprises: a bottom dielectric layer overlapping the first top surface; a charge storage layer over the bottom dielectric layer, wherein the charge storage layer comprises a first portion lower than the second top surface, and a second portion higher than the second top surface; a top dielectric layer over the charge storage layer; a gate over the top dielectric layer; and a source region and a drain region in the semiconductor substrate, wherein the source region and the drain region are isolated from each other by a channel of the first embedded flash memory device, and the source region and the drain region extend from the second top surface into the semiconductor substrate; a non-memory transistor comprising: a gate dielectric overlapping the second top surface of the semiconductor substrate; and a gate electrode over the gate dielectric, wherein a top surface of the gate of the first embedded flash memory device is coplanar with a top surface of the gate electrode of the non-memory transistor.
 16. The device of claim 15 further comprising a second embedded flash memory device comprising: an additional bottom dielectric layer overlapping a third top surface of the semiconductor substrate, wherein the third top surface is coplanar with the first top surface; and an additional charge storage layer overlapping the additional bottom dielectric layer, wherein an entire top surface of an entirety of the portion of the semiconductor substrate between the first and the second embedded flash memory devices is coplanar with the first top surface.
 17. The device of claim 15 further comprising a second embedded flash memory device comprising: an additional bottom dielectric layer overlapping a third top surface of the semiconductor substrate, wherein the third top surface is coplanar with the first top surface; and an additional charge storage layer overlapping the additional bottom dielectric layer, wherein an intermediate portion of the semiconductor substrate between the first and the second embedded flash memory devices comprises a top surface coplanar with the second top surface.
 18. The device of claim 17, wherein the bottom dielectric layer extends on a sidewall of the intermediate portion of the semiconductor substrate.
 19. The device of claim 1, wherein an entirety of the first source region is on an opposite side of the first gate stack than an entirety of the first drain region, with the first source region and the first drain region being at substantially a same level.
 20. The device of claim 1, wherein the bottom dielectric layer extends from the top surface of the semiconductor substrate into the semiconductor substrate. 